Search CORE

17 research outputs found

Zero-shot language transfer for cross-lingual sentence retrieval using bidirectional attention model

Author: A Moro
DE Losada
DS Munteanu
DW Oard
GA Levow
I Vulić
L Ballesteros
P Bojanowski
P Resnik
P Sorg
R Navigli
S Hochreiter
S Robertson
SA Rauf
Publication venue: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Publication date: 01/01/2019
Field of study

We present a neural architecture for cross-lingual mate sentence retrieval which encodes sentences in a joint multilingual space and learns to distinguish true translation pairs from semantically related sentences across languages. The proposed model combines a recurrent sequence encoder with a bidirectional attention layer and an intra-sentence attention mechanism. This way the final fixed-size sentence representations in each training sentence pair depend on the selection of contextualized token representations from the other sentence. The representations of both sentences are then combined using the bilinear product function to predict the relevance score. We show that, coupled with a shared multilingual word embedding space, the proposed model strongly outperforms unsupervised cross-lingual ranking functions, and that further boosts can be achieved by combining the two approaches. Most importantly, we demonstrate the model's effectiveness in zero-shot language transfer settings: our multilingual framework boosts cross-lingual sentence retrieval performance for unseen language pairs without any training examples. This enables robust cross-lingual sentence retrieval also for pairs of resource-lean languages, without any parallel data

Crossref

MAnnheim DOCument Server

Apollo (Cambridge)

Information Retrieval under Constricted Bandwidth

Author: DW Oard
DW Oard
G Salton
RS Taylor
S Curt
T Howard
W Douglas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2003
Field of study

Crossref

Cross-language patent matching via an international patent classification-based concept bridge

Author: Brown PF
Hlava M
Karanikolas N
Loukachevitch NV
Oard DW
Petrelli D
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Relevance Measures Using Geographic Scopes and Types

Author: Andogah Geoffrey
Bouma Gosse
Jikoun
Mandl T
Muller H
Oard DW
Penas A
Peters C
Petras
Santos D
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2008
Field of study

This paper proposes two kinds of relevance measures to rank documents by geographic restriction: scope-based and type-based. The non-geographic and geographic relevance scores are combined using a weighted harmonic mean. The proposed relevance measures and weighting schemes are evaluated on GeoCLEF 2007 dataset with encouraging performance over the standard IR performance. The best performance is achieved when the importance of non-geographic relevance scores outweigh the importance of geographic relevance scores

Content-Based Image Retrieval Using Combined 2D Attribute Pattern Spectra

Author: Jikoun
Mandl T
Muller H
Oard DW
Penas A
Peters C
Petras
Santos D
Tushabe Florence
Wilkinson Michael. H.F.
Publication venue: University of Groningen, Johann Bernoulli Institute for Mathematics and Computer Science
Publication date: 01/01/2008
Field of study

This work proposes a region-based shape signature that uses a combination of three different types of pattern spectra. The proposed method is inspired by the connected shape filter proposed by Urbach et al. We extract pattern spectra from the red, green and blue color bands of an image then incorporate machine learning techniques for application in photographic image retrieval. Our experiments show that the combined pattern spectrum gives an improvement of approximately 30% in terms of mean average precision and precision at 20 with respect to Urbach et al’s method

Automatic transcription of Czech language oral history in the MALACH project: resources and initial experiments

Author: Byrne WJ
Demner Fushman D
Dorr B
Gustman S
Hajic J
Oard DW
Picheny M
Ramabhadran B
Resnik P
Soergel D
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/09/2002
Field of study

CUED - Cambridge University Engineering Department

Reranking Hypotheses of Machine-Translated Queries for Cross-Lingual Information Retrieval

Author: A Fujii
AR Aronson
B Herbert
BL Humphreys
C Macdonald
D Hiemstra
DW Oard
K Crammer
L Goeuriot
P Pecina
PL Schuyler
Publication venue
Publication date: 01/01/2016
Field of study

Machine Translation (MT) systems employed to translate queries for Cross-Lingual Information Retrieval typically produce single translation with maximum translation quality. This, however, might not be optimal with respect to retrieval quality and other translation variants might lead to better retrieval results. In this paper, we explore a method exploiting multiple translations produced by an MT system, which are reranked using a supervised machine-learning method trained to directly optimize the retrieval quality. We experiment with various types of features and the results obtained on the medical-domain test collection from the CLEF eHealth Lab series show significant improvement of retrieval quality compared to a system using single translation provided by MT

Crossref

Biblio at Institute of Formal and Applied Linguistics

Replicating Relevance-Ranked Synonym Discovery in a New Language and Domain

Author: AD Lucia
C Carpineto
DW Oard
GW Furnas
J Nivre
L Zhang
M Braschler
M Stanojević
O Levy
P Bojanowski
R Östling
Steven N. Goodman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Crossref

MPG.PuRe

Extracting Bimodal Representations for Language-Based Image Retrieval

Author: AWM Smeulders
D Hiemstra
D Hiemstra
DW Oard
F Jong de
GE Forsythe
K Netter
M Brachsler
M Cascia
M Flickner
M Marsicoi
S Deerwester
ST Dumais
T Gevers
W Kraaij
Y Yang
Publication venue: Springer-Verlag
Publication date: 01/01/2000
Field of study

This paper explores two approaches to multimedia indexing that might contribute to the advancement of text-based conceptual search for pictorial information. Insights from relatively mature retrieval areas (spoken document retrieval and cross-language retrieval) are taken as a starting point for an investigation of the usefulness of the concept of bimodal dictionaries and of clustering features from multi-modal documents into one semantic space. One of the advantages of the presented techniques is that they are domain independent. 1 Introduction Among the various types of objects that one could want to search for in the multimedia domain, image content seems to be one of the more challenging types. Speedy and easy access to image content in the general domain is not supported by today's search tools and technology, and in spite of progress in content based image retrieval or advances in the area of video logging (a technique which reuses subtitles or speech transcripts for the..

CiteSeerX

Crossref

Radboud Repository

Extracting bimodal representations for language-based image text retrieval

Author: AWM Smeulders
D Hiemstra
D Hiemstra
DW Oard
F Jong de
GE Forsythe
K Netter
M Brachsler
M Cascia
M Flickner
M Marsicoi
S Deerwester
ST Dumais
T Gevers
W Kraaij
Y Yang
Publication venue: Springer
Publication date: 04/02/1999
Field of study

CiteSeerX

Crossref

Radboud Repository

University of Twente Research Information